Attentive Feature Augmentation for Long-Tailed Visual Recognition
نویسندگان
چکیده
Deep neural networks have achieved great success on many visual recognition tasks. However, training data with a long-tailed distribution dramatically degenerates the performance of models. In order to relieve this imbalance problem, an effective Long-Tailed Visual Recognition (LTVR) framework is proposed based learned balance and robust features under circumstances. framework, plug-and-play Attentive Feature Augmentation (AFA) module designed mine class-related variation-related original samples via novel hierarchical channel attention mechanism. Then, those are aggregated synthesize fake cope dataset. Moreover, Lay-Back Learning Schedule (LBLS) developed ensure good initialization feature embedding. Extensive experiments conducted two-stage method verify effectiveness both learning classifier rebalancing in image task. Experimental results show that, when trained imbalanced datasets, achieves superior over state-of-the-art methods.
منابع مشابه
Attentive Visual Recognition for Scene Exploration
Vision is an active process where behaviorally important information is selectively gathered. We present a model of scene exploration in which the high-resolution fovea is deployed to interesting regions by selective attention processes. The system switches between exploratory and recognition modes such that in exploration mode, regions of interest are investigated and in recognition mode, a gi...
متن کاملAttentive Tracking Disrupts Feature Binding in Visual Working Memory.
One of the most influential theories in visual cognition proposes that attention is necessary to bind different visual features into coherent object percepts (Treisman & Gelade, 1980). While considerable evidence supports a role for attention in perceptual feature binding, whether attention plays a similar function in visual working memory (VWM) remains controversial. To test the attentional re...
متن کاملLearning Convolutional Feature Hierarchies for Visual Recognition
We propose an unsupervised method for learning multi-stage hierarchies of sparse convolutional features. While sparse coding has become an increasingly popular method for learning visual features, it is most often trained at the patch level. Applying the resulting filters convolutionally results in highly redundant codes because overlapping patches are encoded in isolation. By training convolut...
متن کاملPre-attentive visual selection
In this note, we consider the critical function of the selection from an entire visual scene of visual locations or objects for detailed or attentive processing. Such selection of some places at the expense of others is necessary because attention has only a meagre capacity (estimated at just 40 bits/second; Sziklai 1956). The same bottleneck implies that the selection process itself cannot gen...
متن کامل10 Attentive Visual
A panel of computer vision researchers recently convened and produced a document concerning the relatively new eld of \Active Vision" 36]. Although there is no precise deenition of what active vision is, most researchers would agree that active vision is concerned with controlling camera parameters, such as position, focal length, aperture width, and so on, in ways that serve to make vision pro...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Circuits and Systems for Video Technology
سال: 2022
ISSN: ['1051-8215', '1558-2205']
DOI: https://doi.org/10.1109/tcsvt.2022.3161427